Quit Emailing Yourself

10 links tagged with machine learning

Click any tag below to further narrow down your results

+ pytorch (4) + language models (3) + transformers (2) + long-range dependencies (2) + text quality (2) + audio synthesis (1) + automation (1) + postgresql (1) + dissatisfaction (1) + preference learning (1) + distributed (1) + tensorflow (1) + security (1) + npm (1) + video generation (1)

Links

Researchers tout vector-based automated tuning in PostgreSQL • The Register

Researchers from Carnegie Mellon University have developed a vector-based automated tuning system called Proto-X for PostgreSQL databases, which can enhance performance by two to ten times. By utilizing a holistic tuning approach and an LLM booster, the system can significantly reduce the time needed for optimization from 12 hours to about 50 minutes, making database management easier for developers with less experience.

Saved by hn_user_14 · Last saved October 28, 2025 · 3 min read

+ postgresql + automation machine learning ✓

[2510.02341] DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning

The article presents DRIFT (Dissatisfaction-Refined Iterative Preference Training), a novel approach to preference learning that utilizes abundant implicit user dissatisfaction signals from real-world applications like conversational AI and code generation. By focusing on these dissatisfaction signals and dynamically sampling positive feedback, DRIFT improves performance on various benchmarks, surpassing existing methods and preserving exploratory capabilities in model training.

Saved by hn_user_7 · Last saved October 28, 2025 · 3 min read

+ preference learning + dissatisfaction machine learning ✓

[2510.15061] Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models

The article presents "Antislop," a framework designed to identify and eliminate repetitive patterns, or "slop," in language models that degrade text quality. It introduces three innovative tools: the Antislop Sampler for suppressing unwanted phrases, an automated profiling pipeline, and Final Token Preference Optimization (FTPO) for fine-tuning token logits, achieving significant slop reduction while maintaining or enhancing performance across various evaluation tasks.

Saved by hn_user_15 · 2 others saved this · Last saved October 28, 2025 · 3 min read

+ language models machine learning ✓ + text quality + text generation

Introducing PyTorch Monarch – PyTorch

The article introduces PyTorch Monarch, a new distributed programming framework designed to simplify the complexity of distributed machine learning workflows. By adopting a single controller model, Monarch allows developers to program clusters as if they were single machines, seamlessly integrating with PyTorch while managing processes and actors efficiently across large GPU clusters. It aims to enhance fault handling and data transfer, making distributed computing more accessible and efficient for ML applications.

Saved by hn_user_13 · 1 other saved this · Last saved October 28, 2025 · 3 min read

+ distributed + pytorch machine learning ✓ + distributed computing

The State of Machine Learning Frameworks in 2019

The article analyzes the state of machine learning frameworks in 2019, highlighting a significant shift towards PyTorch among researchers while TensorFlow remains dominant in industry applications. It presents data showing PyTorch's rapid adoption in major research conferences, citing reasons such as simplicity, a better API, and performance. The future for TensorFlow in research appears uncertain as PyTorch solidifies its majority status within the community.

Saved by hn_user_7 · Last saved October 28, 2025 · 3 min read

+ pytorch + tensorflow machine learning ✓

[2510.00184] Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

The article investigates the limitations of Transformers in performing multi-digit multiplication, revealing that while these models can encode necessary long-range dependencies, they often converge to local optima that fail to utilize them effectively. The authors propose an auxiliary loss to enhance learning dynamics and successfully address the issue of learning long-range dependencies in Transformers.

Saved by hn_user_7 · 1 other saved this · Last saved October 28, 2025 · 3 min read

+ transformers machine learning ✓ + long-range dependencies

How Cloudflare’s client-side security made the npm supply chain attack a non-event

The article discusses how Cloudflare's Page Shield effectively mitigated the npm supply chain attack that compromised 18 popular packages, preventing attackers from stealing cryptocurrency and other sensitive information. Utilizing advanced machine learning techniques, Cloudflare assesses billions of scripts daily to identify and block malicious code, ensuring enhanced security for users.

Saved by hn_user_14 · Last saved October 28, 2025 · 2 min read

+ npm + security machine learning ✓

GitHub - character-ai/Ovi

The article describes Ovi, a video and audio generation model developed by Character AI that can create synchronized content from text or text+image inputs. It highlights its features such as high-quality audio, flexible input options, and support for various resolutions, along with links to demos and installation guidance. The project aims to enhance video creation capabilities while maintaining temporal and spatial consistency.

Saved by hn_user_1 · Last saved October 27, 2025 · 3 min read

+ video generation + audio synthesis machine learning ✓

Notion

The article discusses load balancing in the context of MOE (Mixture of Experts) models, highlighting its importance for optimizing resource allocation and performance in machine learning tasks. It outlines various strategies and techniques for effective load balancing to enhance the efficiency of these models.

Saved by hn_user_3 · Last saved October 27, 2025 · 1 min read

+ load balancing + moe machine learning ✓

the bug that taught me more about PyTorch than years of using it | Elana Simon

The article recounts a bug encountered while using PyTorch, where a GPU kernel issue on Apple Silicon caused a training loss to plateau unexpectedly. The author details the investigative process of identifying the bug, which involved understanding PyTorch internals and debugging steps that illuminate the framework's complexity. This experience ultimately provided a deeper understanding of PyTorch than years of regular use.

Saved by hn_user_7 · Last saved October 27, 2025 · 3 min read

+ pytorch + debugging machine learning ✓